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This publication is divided into four 
sections. 

Section 1 compares the environments 
available to the CE for testing and 
repairing I/O devices. The advantages of 
using the virtual machine as a tool for 
fault analysis is also described. h 
comparison of OLTS (online test system) 
results from both the real and the virtual 
system/370 is also discussed. 

Section 2 discusses the requirements for 
testing I/O devices from a virtual machine 
environment which includes the following: 

• The CE virtual machine 

• How to log onto a virtual machine 

• How to run the online tests 

• Samples of test runs 

This section provides information to 
permit the CE to run diagnostic tests in a 
virtual machine environment from a virtual 
machine console (terminal) . 

Section 3 describes the VM/370 system 
error recovery, error recording, and system 
console error messages, and the control 
blocks used in the error recovery/ 
recording process. 

Section U describes VM/370 facilities 
that allows more detailed information to be 
obtained for problem analysis and repair. 
These include: 

• CPEREP and OS/VS EREP 

• Intensive Recording Mode 

• Trace Option 

• VMFDiJMP 



PREREQUISITE PUBLICATIONS 
IM liEtual Machine Facility/37 0: 
Iati^2duction, Order No. SC20-1800 



Terminal User*s Suide, Order 
GC20-1810 



No, 



If the IBM 3767 Terminal is used, IBM 
U.§2. Terminal Operator* s Guide, Order No. 
GA18-2000, is also prerequisite. 

If the system being serviced makes use 
of the IPCS (Interactive Problem Control 
System) component of VM/370, then the 
YM/370 Interactive Problem Control System 
(IPCS) Dser^s Guide, Order No. GC20-1823 is 
also a prerequisite. 



COREQUISITE PUBLICATIONS 

IM Virtual MaSkiHe Faci li 1 2/3 7 : 

CP Command Reference for General Users, 
Order No. GC20-1820 

System Messages, Order No. GC08-1808 

OS/VS, DOS/VSE, VM/370 Illionmental 
Recording^ Editing and Printing (EREP) 
Program, Order^No. GC28-0772 

OS/VS, DOS/VSE, VM/370 Enii£2ni§nta.l 

Recording, EditM3 §.2:1 PSifiiiaS (5E5£) 
Program Log^ic, Order No. ST28-0773 

OS/VS, DOS/VSE, VMZl^O EREP Messages, Order 
No. GC38-1045 

Figure 1, which follows the Preface, 
shows the relationship of VM/370 
publications to one another within the 
VM/370 Library. 

RELATED PUBLICATIONS 



The following texts, although not required, 
will broaden the CE's knowledge of VM/370 
and virtual machines. 

IBM Virtual Machine Fa ci lit 1/370 : 

Planning and System Gen eration Guide, 
Order No. GC20-1801 

CHS Oser^s Guide, Order Ho. GC20-1819 

Oper ato r's Guide. Order No. GC20-1806 

Remote Spooling Communications Subs yst em 
User's Guid eT Order No. GC20-18167 

III 3704 and 3705 Communications 
Con trollers Eet*2El^ Control Proqram/VS, 
Program Logic Manual, Order No. SY30-3007. 
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In this publication, the term "3330 
series" is ued in reference to the IBM 3330 
Disk. Storage, Models 1, 2, and 11 and the 
IBM 3333 Disk Storage and Control, Model 1 
and 11. 

In this publication, the term "2741" is 
applicable and equivalent to the IBM 3767 
Terminal unless otherwise specified. 



3158 display consoles, when used in display 
mode unless otherwise noted. 

Any information pertaining to the IBM 
3284 or 3286 printer also pertains to the 
IBM 3287, 3288 and 3289 printers unless 
otherwise noted. 

Notes: 



The term "jz/u" is usea in tnis 
publication to refer to a series of display 
devices, namely, the IBM 3275, 3276, 3277 
and 3278 display stations- \ specific 
device type is used only when a distinction 
is required between the device types. 

The term "FB-512" is used in this 

publication to refer to those IBM DASD 

devices that implement fixed block (512 

byte blocks) mode. Specifically, these 
devices are the 3310 and 3370. 

Information about display terminal usage 
also applies to the IBM 3138, 3148 and the 



cause OLTSEP Release 4.0, 4.1, or 5.0 
execution problems; refer to the 
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In attempts to service components of a 3850 Mass Storage System, the 
CE should be aware that the virtual machine is interfacing with 
virtual 3330 volumes (3330V) and not with a real 3330 device; thus, 

tie misapplication o f diagnostics cou ld le ad jt^ erroneous 

interpretations. 

In testing components of the 3850 Hass Storage System (HSS) , most 
functions provided by the online test system (OLTS) require that MSS 
activity be quiesced. To insure a quiesced mass storage system, it 
is recommended that the CE's virtual machine be run in a standalone 
environment. 

The CPOID found in the error recording records is the CPOID 
associated with the real machine and not the one associated with a 
virtual processor. 

If the facilities of an IBM 3850 Mass Storage System (MSS) are used 
with VM/370 virtual machine operations and MSS errors are reflected 
to VM/370 *s error recording area, CPEREP must be invoked so that 
MSS-related errors recorded in the error area can be directed to an 
accumulation (ACC=Y) tape for further processing by the VS1/VS2 
Subsystem Data Analyzer (SDA) Program. Because MSS logged out data 
is voluminous and the interrelationship of MSS components is complex, 
it is imperative that this service program be used to effectively 
diagnose and isolate mass storage problems. 

The virtual machine used by the CE normally does not have a dedicated 
high speed printer. Therefore, long listings (such as console 
spooling records, dumps, error recording records, and diagnostic 
output tabulations) are queued to a common spool output device along 
with the files generated by users of other virtual machines. These 
files are queued by class as well as by the time at which the files 
are closed. If the queue for output is long or contains files that 
are sequentially ahead of the CE»s output records, the wait for 
output could be quite lengthy. However, the system operator can 
alter the order (sequence) of output files, if the need is urgent- 

The I/O configuration of the virtual machine should be such that each 
virtual channel maps to real channels of a single type and model. 
This requirement is explained in detail in "Appendix E. VM/370 
Restrictions" in VM/_370 System Kessa^es. If this requirement is not 
met, the STIDC instruction may return inconsistent results, and any 
data from a channel extended logout may be misinterpreted since it 
depends on the channel model. Also note that there is a restriction 
against using control register 14 to mask out channel extended 
logouts; if this is done in a virtual machine, the logout does not 
remain pending and instead is lost. 

Hardware and software problems in VM/370 can cause the abrupt 
termination of a virtual machine- Such terminations (such as forced 
logoff) cause the register and storage contents of the virtual 
machine to be lost, thus rendering problem analysis ineffective. 
Facilities exist within VM/370 to preserve the register and storage 
contents of prespecified virtual machines on prespecified DASD 
locations so that, when system termination occurs, system analysis 
and system recovery can be accomplished. For details, refer to 
"Virtual Storage Preservation," described in VM/370 System 
££23£amiS£l§ Guide. The saving of virtual machines on a VMSAVE area 
can be adversely affected by malfunctions of the checkpoint, spooling 
and abend dump modules as well as channel check handler and machine 
check handler modules. 
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• The third, fourth, and fifth lines represent the virtual unit record 
spool devices and addresses that are mapped by the system to 
equivalent real devices = The letter B located on line four and the 
letter A located on line five represent the assigned spool class for 

--- — tfrcrtr--d«vi:eev- - --■- -— - -- - - - - - - - - - - - 

• The sixth line is an entry describing a 2314 minidisk with an address 
of 190 that contains the CHS system residence files on cylinders 000 
to 050. This disk is labeled CHS 190 and this user has read-only 
(R/0) access to this disk, since it is the CHS system disk, 

• The seventh line is interpreted the same as the sixth line with the 
following exceptions: only five cylinders are allocated to the user 
of the 231U volume labeled CHS001. However, the write (W) access 
privilege allows the CE to write routines or modify existing routines 
and store them permanently on this disk. 

For further details on VH/370 user directory entries, see the YH/370 
Plajaning a n d System Generation Guide. 
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Command Privilege Class for the CE 



The CE*s virtual machine is similar to other virtual machines running 
under VM/370, The CE*s virtual machine reacts to the System/370 machine 
instruction set in much the same manner as on a dedicated System/370. 
Control of the virtual machine is through a terminal and CP commands. 
These commands are grouped into eight privilege classes. Each class 
relates to specific system functions. The privilege class or classes of 
commands assigned to a particular virtual machine are stored in the 
7M/370 directory along with the user's virtual machine identification 
code and password. 

As a user of a virtual machine, it is assumed that the CE has the 
class G and F commands and CMS allocated for his use. CHS is discussed 
briefly in the VM^370 Introduction. CMS is important to the CE because 
this environment must be entered to execute the CPEREP command. CPEREP, 
when invoked, calls EREP modules that format and print error recording 
data; optionally, CPEREP may be used to create an accumulation tape 
(ACC=option) or edit an existing accumulation tape (HIST option) ; even 
SYS1.L0GREC data sets on tape or disks compiled from other systems may 
be used. If the CE in a remote location has access to any of the remote 
terminals supported by the RSCS component of VB/370, he may utilize the 
facilities of RSCS to transfer bulk data, such as trace output and error 
recording printouts, to a remote printer. Remote spooling procedures 
are described in the VMZ1Z2 S§CS Oser^s Guide. 

The use of CPEREP is also important in relation to its use with the 
3850 Mass Storage System. Errors accumulated on the VM/370 error 
recording area must be placed in the CPEREP accumulator output tape for 
additional processing and analysis by the VS1/VS2 subsystem data 
analyzer (SDA) program. For details on how this is accomplished, refer 
to CPEREP and OS/VS EREP in Section 4 (for a description on how to 
create an output tape) and then refer to either OS/VSl SYSl. LOGREC 
lIE2£ E§cording, Order No. GC28-0668 or 0S/VS2 System ProgramminaLibrary 
SYSl. LOGREC Error Recordina, Order No. GC28-0677 for details. 

The class F commands include the SET RECORD and SET MODE commands. 
With these commands, the CE can set requirements for intensive or soft 
error recording. Refer to "Section 3. Additional CE Aids." Class F 
allows the CE to void error recording that occurs as a result of the 
CE's virtual machine activity except for the device and condition 
specifically named in the SET RECORD command. 

Class F also allows the CE to generate trace data for a specified 

3704 or 3705 BTU (basic transmission unit) or resource by means of the 

NETWORK TRACE command. The produced trace data is then spooled to the 

I CE's virtual printer. Class F is also necessary for access to the *CE 

I area' on FB-512 devices. The size and location of this area is 

j described in the particular device reference publication. Class G 

commands comprise a complete set of commands for virtual machine use. 

In addition to the Class F and G commands, there are commands that 
are not confined to any assigned command category. These commands, 
referred to as the class "Any" commands, can be invoked regardless of 
logon status. Examples of these commands are MESSAGE and LOGON. 

This book illustrates the use of only those VM/370 commands necessary 
for CE applications. However, if additional help is necessary, the CE 
can solicit help from the system operator via MESSAGE OPERATOR command, 
or use the VM^370 CP Command Reference for General Users, the VM/370 
02§£atorls Guide, and VM/370 CMS Command and Macro Reference. 
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If the CE receives "virtual machine already in use," or "...exceeds 
system parameters" in addition to "vm/370 online" and possibly RESTART, 
use the MESSAGE command and contact the system operator for assistance. 



To invoke the MESSAGE comffland for cdmmunication tO" — the — system 
operator any of the following forms may be used: 

message operator message-text 
msg op message-text 
m op message-text 

If a message response from the system operator is not forthcoming or 
the message cannot be entered via terminal equipment, then other media 
must be used to establish communication with the system operator. 

If an acknowledgement of the message is received by the CE, then line 
and terminal coiiunication have been successfully established. 



OSING THE LOGON COMMAND 

If line and terminal performance is satisfied, failure to log on can be 
the result of improper use of the LOGON command and its associated 
operands. The correct procedure involves knowing the correct password 
and CE userid. 

Assume that the LOGON was invoked correctly but the response was a 
facsimile of one of the following: 

MAXIMUM USERS EXCEEDED 

INVALID OSERID 

DSERID NOT IN CP DIRECTORY 

PASSWORD INCORRECT 

ALREADY LOGGED ON LINE raddr 

CE action should be to relay this data via the MESSAGE command back 
to the system operator. The system operator can then defer maintenance 
to a later time period, or can arrange an environment so that CE LOGON 
is successful. Once logon is successful, the CE can use OLTSEP and the 
online test sections (OLTS) . Samples of invoking OLTSEP are shown later 
in the text. 

To assist in the process of entering the tests or other data, the CE 
can use VM/370' s four input line edit functions; they are described in 
the VM^370 Terminal User's Guide. Briefly, an a symbol when entered 
deletes the previously entered character on the logical input line. The 
^ symbol deletes the previously entered input line. The # symbol is 
used to signal the end of a logical input line so that multiple logical 
input lines can be entered on the same terminal input line. The " 
symbol is issued as an escape character, that is, it cancels the line 
edit function of a following 3, #, ^, or " character and allows that 
line edit function character to be accepted as data. 

After successful logon, the CE must enter the environment needed to 
perform the function he desires. To store or display storage or 
registers in the virtual machine, the environment to use is CP; to 
invoke CPEREP to edit error recording, the CE must first perform an IPL 
and enter the CMS environment. To use the online test sections, the 
OLTSEP program must be loaded into the virtual machine. Details on 
logon, the initial program load (IPL) operation, and the virtual logoff 
process (ending the terminal session) are described in VM/370 Terminal 
User's Guide. 
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Invoking OLTS 



To load the OLTSEP and OLTS programs from a tape or a disk, the CE must 
have the operator attach the IPL device containing the tests to his 
virtual machine. This can be accomplished by asking the operator, or, 
if the CE is at a remote location, the CE can communicate by sending him 
messages on a terminal such as the following: 

msg operator mount my diagnostic pack on 181 - ce 

msg operator put scratch tape on 583 - test device 

The operator will then mount and make ready the devices desired for 
testing by the CE. The operator then issues the ATTACH command; the 
CE's terminal then indicates: 

DEV 181 ATTACHED 
DEV 583 ATTACHED 

In the case of system-owned volumes (DASD devices) that cannot be 
directly attached to the CE»s virtual machine, testing is facilitated by 
defining the device as a full extent minidisk with a relocation factor 
of (that is, the DASD device is described in the system with its 
■inimua and maximum cylinder or block values) . The CE can then use the 
LINK command to link to the device (via password identification) in 
write mode to execute the prescribed tests. 

Under these conditions, the diagnostic used must confine its write 
operation to the CE cylinders only. Dse of system owned disks by the CE 
can be achieved by directory entry in the CE's virtual machine or by the 
use of the LINK command. 

The CE is now ready to load his virtual machine with OLTSEP. This is 
done by issuing the IPL command to the addressed device. Upon 
completion of the operation, OLTSEP responds to the CE»s terminal as 
though he were using the real system console (3215, etc.) . Figure 5 
shows a sample of the complete logon operation, OLTS testing, and logoff 
operation as initiated from a 2741 console. The 2741 sample session 
shown in Figure 5 would suffice for diagnostics run from a display 
terminal. The major difference is that the exclamation point is not 
indicated on the screen's output area; instead, a change in screen 
status information is indicative of attention signaling. 

Notes : 

1. While the execution of OLTS in a virtual machine is usually 
identical to execution on a real machine, differences exist for 
specific types of test devices. Terminal control devices (2701, 
2702, 2703, 3704, and 3705) often appears to respond differently to 
tests executed in a virtual machine. A control run should be 
executed against a device that is known to be operating correctly, 
and the error shown should be considered the normal results when 
the OLTS are run in virtual machine. 

2. If the OLT section selection (DEV/TEST/OPT) defines the same 
terminal that is serving as the virtual system console, refer to 
the topic, "Invoking OLTS to Virtual Machine Console Terminals." 
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§.10. 1§. SM^llisa of Channel Errors 

Channel errors are handled differently from device errors. CP records a 
c:hanlieX~c:hec1c 1^^ recording area-iTmire-diateiy ctudr ±n:forTiis- 

the VM/370 system operator of the channel check via a console message 
(but for a channel data check, no message is issued) . Then CP reflects 
the channel check to the virtual machine. After seeing the error, the 
operating system in the virtual machine issues SVC 76. Since CP has 
already recorded the error, CP ignores the SVC 76 and reflects it to the 
virtual machine (without translating virtual channel and device 
addresses in the error record to real addresses) . The reflected SVC 76 
then causes the operating system in the virtual machine to record the 
channel error in its own LOGREC data set. 



SVC 76--Parameter Passing 

VM/370 examines the contents of general registers and 1 to determine 
if valid conditions exist for handling the error recording data. 

If the system is OS (Release 21 or above) , VS/1, VS/2, or VM/370 (in 
a virtual environment) then: 

General register = two's complement of the error record length 
General register 1 = address of the record 

If the system is DOS (Release 27) or DOS/VS then: 

General register = address of the error record minus 8 
General register 1 = Byte 0, Bit must be a 1 , 

Bytes 1, 2, and 3 contain the CCB address 

(DOS control block for I/O) 

VM/370 then locates the formatted error record and examines the 
record header for a valid operating system identity (ID) . The record 
type then examined to determine if it is one of the supported recording 
types. 



Section 3. Error Handling 41 



Pg. of GC20-1809-7 Rev June 29, 1979 by Supp SD23-9005-1 for 5748-XE1 

Record Modification for VM/370 Error 
Recording 



The error record is modified, changing virtual information to real. The 
fields modified vary with the type of record. 

• Type 30, OBR (Outboard Recorder) 

Common Fields: 

EEiia^I and Alternate CUi are replaced with the real device address 
corresponding to the virtual device address. 

CPOID {CPU model number) is replaced with the real machine model 
number. 

JOBID is replaced with the virtual user ID. 

2§li2§ Dependent Fields: 

For dedicated D&SD devices no modification is required. For 
nondedicated DASD devices, the following modifications are required: 

Seek Add£§ss, the relocation factor, found in the VDEVBLOK, adjusts 

the seek address field of the record in order to reflect the true 

I real seek address if the DASD device is count-key-data device; or 

I adjusts the physical block number if the device is a fixed block 

I device. 

Home Address Read, the relocation factor, found in the VDEVBLOK, 
adjusts the home address read field in order to reflect the true 
real home address. 

Volume ID, the volume label in the RDEVBLOK, replaces the volume ID 
in the record. 

3330, 3340, 3350, and 2305, the relocation factor in the VDEVBLOK, 
adjusts the cylinder address portion of the sense data (sense bytes 
5 and 6) . 

Virtual 23J.1 on 2314, the device type is changed to 2314 and sense 
byte 3 is altered to reflect 2314 information. For this situation, 
the 2314 module ID usually found in the sense byte is not 
available. 

Note: The failing CCW and CSH fields are not altered. This results 
in the CCJi address in the CSW and data address in the CCW being 
virtual, not real. 

• Type 40, 41, 42, 44, 48, and 4F programming abend records: 

Common Fields: 

CPOID (CPO model number) is replaced with the real machine model 
number. 

JOBID is replaced with the virtual user's ID. 
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RDEVBLOK to determine if intensive recording mode is in effect for this 
device. If the conditions are met, an I/O error record is created. 



. -^ a _ 3 



^ 1- ^ . 



Control is returned to the I/O supervisor, which reflects the error to 
the user of the" I/O" o^ 
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Figure 12. Control Block Relationship for SDR Counter Update 
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I/O Error Recording and Error Recording Area 



The error recording facilities of VM/370 format and record outboard 
error records, and record formatted machine check and channel check 
rpcori^s nre^A + ^d hv thf» RM?s ronri n{=.=: nf V.M/170. 



j.uucjL.iic:o yji. 



The error recording routines of VM/370 do not actually perform I/O 
operations. Instead, the I/O error routines treat the error recording 
area allocated on the VM/370 system residence pack as a logical 
extension of VM/370 storage. These extensions of VM/370 storage are in 
the form of logical pages that can be read and written out of by the 
paging supervisor of VM/370. The error recording routines place 
multiple error records within a page; when an error record is assembled 
within a page, a pointer is updated to indicate the beginning of any 
unused area. The next error record is checked to see if it can be 
contained in the remainder of this page. If it can, the error record is 
read into the page and the pointer is updated to again reflect any 
residual storage available for the next error record. This process 
continues until an error record is encountered that cannot be contained 
within the page. Hhen this happens, the page is scheduled to be read 
out to the next available slot in the error recording area and a new 
page in storage is assigned to accept and retain the error record. The 
process continues in like manner, 

I On count-key-data devices, the error recording area is from two to 

I nine adjacent cylinders assigned on the system residence volume. The 

starting cylinder number and number of cylinders are specified in VM/370 

I generation procedures. On FB-512 devices the error recording area is 

I any number of adjacent pages assigned on the system residence volume, 

I The starting page number and the number of pages are specified in the 

I VM/370 generation procedures. In any case when the error recording area 

is 90% full, and again when 100% full, the I/O error routines instruct 

the VM/370 system operator to invoke the CPEREP command to print (or 

create a tape of) the error data and erase the recording area. Errors 

are recorded in the order of occurrence until the allotted space is 

exhausted. 

With the support provided for the 3031, 3032, and 3033 processors, 
CPEREP need not be aware of the content or the EC level of the processor 
logouts in order to format machine check and channel check records. 
Format and content information is provided via the SRF (Service Record 
File) device. Frames (records containing text and scan buffer codes) 
are maintained on the SRF device by customer engineering, and software 
makes use of these frames to interpret and format inboard errors. 
Whenever the VM/370 error recording cylinders are formatted on a 3031, 
3032 or 3033 processor, the SRF is accessed and the frames are 
retrieved, formatted as frame records, and recorded at the beginning of 
the VM/370 error recording area by the process described above. When 
CPEREP is invoked, these frame records are used to format MCH and CCH 
records for the printed report. 

The SRF device is accessed by VM/370 to read frame data (a) during 
VM/370 system initialization if the error recording cylinders have not 
been previously formatted; and (b) as a result of running CPEREP with 
the CLEARF operand. To ensure that the VH/370 control program has 
access to the SRF device, the following steps should be followed to 
activate the SRF: 

1. Check that the I/O interface for the service support console is 
enabled. 
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2. Obtain the configuration frame on the service support console. 

3. The SET appears disabled until accessed on the 3032. Activate the 
SR F on t he 3031 and 3033 by selecting SRF mode A2. 
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4. VARY ON cuu (SRF address) on the operator's console. 

5. ATTACH cuu * cuu to attach the SHF device to the 
console^^— OIL ATTACH_cuu_ userid cuu to attach the SRF device to the 



console of the class F user who runs CPEEEP. 



In a 303x environment, access to the SRF device by an SCP in a 
virtual machine must be considered when planning to run EREP to print 
the error log belonging to that virtual machine. The SRF device must be 
accessible to the operating system in a virtual machine when it 
initializes its error log in order that frame data may be read from the 
SRF. The VM/370 system operator should attach the SRF device to the 
virtual machine before that SCP initializes its error log (for example, 
in the case of 0S/VS2, before running IFCDIPOO) ; the virtual machine 
operator should then vary the SRF online. In single processor mode, the 
SRF device of the VM/370 processor must be attached to the HVS V=R 
virtual machine before «?S runs IFCDIPOO to initialize SYS1.L0GREC. 

The error recording facilities of VM/370 are of the following types: 

OOTBOARD RECORDING; 

• Statistical data recording 

• Permanent I/O errors 

• Environmental data records 

• Intensive mode recordings 

• Specific DASD recording requirements 

• Specific tape recording requirements 

• Software abend records 

IIBOARD RECORDING: 

• Machine checks 

• Channel checks 

I/O Statistical Data Recording (SDR) 

Statistical data recording is the accumulation and the recording of I/O 
error statistics that relate to specific devices. VM/370 supports SDR 
recording for CP-initiated I/O events by building and maintaining device 
statistics tables (counters) in the SDRBLOK associated with the I/O 
device. These counters are updated when a device-dependent error 
recovery procedure (ERP) determines that the error has either been 
corrected successfully or is a permanent error. SDR counters are 
updated based on the sense information in the original lOERBLOK. The 
updating of the counters is done asynchronously. If the update function 
causes a counter overflow, a short OBR record is built. The OBR record 
is then placed on the asynchronous output queue. This causes the OBR 
record to be written on the error recording area asynchronously. 

When the SHUTDOWN command or NETWORK SHUTDOWN command is issued by 
the system operator, any devices that have SDR counters associated with 
them cause control to be passed to the I/O error recording routine to 
format a short OBR record. (A long OBR is formatted for 3400 tapes.) 

The VARY OFFLINE command or NETWORK VARY OFFLINE command of a device 
that has associated SDR counters also causes control to be passed to the 
I/O error recording routine to format a short OBR record (a long OBR is 
formatted for 3400 tapes) . 

The VARY OFFLINE, SHUTDOWN, NETWORK VARY OFFLINE, and NETWORK 
SHUTDOWN commands result in an OBR record being written to the error 
recording area synchronously. 
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Permanent I/O Error Recording 



Permanent I/O errors related to VM/370- initiated I/O events are recorded 
by the I/O error recording routines of VM/370. When a device-dependent 
error recovery procedure determines that an I/O event cannot be 
successfully recovered, the fatal flag is turned on in the lOBLOK and 
control is returned to the I/O supervisor. The I/O supervisor invokes 
the I/O error recording routines sith the control block struct ure as 
shown in Figure 13. The I/O error recording routines format the error 
and record it on the error recording area. 
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Figure 13. Control Block Linkage — Fatal Error Condition 



Environmental Data Recording 



When the I/O supervisor receives a unit check interruption from a 333 0^ 
33'»0, 3350, or 2305, the count-key-data error recovery procedure is 
invoked. If the unit check is from an FB-512 device, that error 
recovery procedure is used. In any case, if the sense information 
indicates that an environmental data recording is required, the error 
recovery procedure builds the necessary channel program to retrieve the 
error log data from the file control unit. 

The sense data that indicates this condition is as follows: 



Machine 

2305 ~ 

3330, 3340, 3350, FB-512 



Sense B^te Bit Condition 

~2 ~ Buffer Log Full 
2 3 Environmental Data 



The manner in which the error recovery procedure passes the data to 
the I/O error recording routine differs between the 2305 and the 
3330/3340/3350/FB-512 as shown in Figures 14 and 15, respectively. 
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Figure 14. 2305 Control Block Structure — Environmental Data Recording 

The lOEREXT field in the lOERBLOK contains the length in doublewords 
of the extended data area. The I/O error recording routine builds an 
environmental data record in the proper format, queues the request for 
recording, and returns to the I/O supervisor. The DASD error recovery 
procedure retries the operation and normal processing continues. 

A different control block linkage exists on 3330/3340/3350 
environmental data recordings due to the amount of data. The DASD error 
recovery procedure builds multiple lOERBLOKS and chains them together to 
pass the data to the I/O error recording routines. 
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I Figure 15. 



3330/3340/3350/FB-512 Control Block Structure—Environiental 
Data Recording 



The lOERLOC pointer in the lOERBLOK points to the next lOERBLOK on 
the string. The error recovery procedures obtain free storage and 
construct lOERBLOKs to be placed on the string until the buffer on the 
control unit is completely unloaded. The I/O error recording routine 
builds an environnental data record in the proper format, queues the 
request for recording, and returns to the I/O supervisor. The error 
recovery procedure retries the operation and normal processing 
continues. 



Intensive Mode Recordings 



On any unit check occurrence the I/O supervisor invokes the I/O error 
recording routines to determine if the conditions for intensive mode 
recording are satisfied. Intensive mode is an error recording mode 
whereby errors are recorded for a specific device that achieves a unit 
check condition and sense data that matches previously defined sense 
data values. The SET RECORD command starts intensive mode. If 
intensive mode recording conditions are satisfied, an I/O error record 
is constructed, formatted, and recorded in the I/O error area of the 
VM/370 system residence device, and a flag is set in the lOERBLOK to 
indicate that the error has been recorded (I0ERFLG2 = lOERCEHD) . This 
recording is done for CP-owned devices as well as dedicated devices 
attached to virtual machines. The user who initiated the intensive mode 
operation must run the CPEREP program to retrieve the records created 
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while this option was active. No messages occur to inform either the 
Ya/370 system operator, or the virtual machine user when a recording is 
made or when intensive mode is disabled by the I/O error recording 

routines-after— the- tenth xeconding^ Xnt_ensive__j«_odje„4SET„R.lLC0BD„pj>.ti^^^ 

can be invoked only on one real hardware device at any time and only by 



a uoKsx. mj.\,ix \, i 



Note: For the privilege class F virtual machine all normal error 
recording is suspended except for the 'intensive mode' selected device. 
If however, the F class user invokes SVC 76 to pass a record to CP to 
record, CP will honor such a request. 

VM/370 I/O Error Recordings 



but the privilege class F user (unless intensive mode is specified for a 
particular device) . 

An unrecoverable (permanent) I/O error which was initiated as a 
VM/370 I/O task (CP request) . 

Counter overflow statistics (SDR count) . 

SHUTDOWN and NETWORK SHUTDOWN commands (devices with SDR counters) . 

VARY OFFLINE (devices with SDR counters) , 

2305/3330/3340/3350/FB-512~Equipment Check. 

2305/3330/33a0/3350/FB-512— Busout Check. 

2305/3330/3340/3350— MDR record on BUFFER UNLOAD command (X«A4' or 
X'24») to a nondedicated DASD storage device by a virtual machine. 

33U0 — Seek Check. 

2305/3340/3350--Data Check. 

2305/3330/3340/3350 — Overrun. 

FB-512 — Data Check. 

Error Recording Record Layout 

Error recordings vary in length and format depending on the malfunction 
or the device encountered. Data that relates to channel check, machine 
check, or unit check conditions is arranged in byte-formatted records in 
the error recording area. Figures 16, 17, 18, 19, and 20 describe the 
layout and data length of the fields within defined record types. Use 
Figure 21 with these charts to ascertain the origin of particular fields 
of data. 

The paired alphabetical characters shown in the fields of Figures 16, 
17, 18, 19, and 20 correspond to the location codes in Figure 21. The 
location code in conjunction with the type of error (HC, UC, CC) 
indicates the availability of that data and what data block or function 
contains or generates this data. 
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Figures 22 and 23, using the sane paired alphabetical character 
scheae, describe the 24-byte header that precedes the error record. 
Figure 23 describes the contents and source of the fields indicated in 
Figure 22. 

For additional information on error record layout as used by the CP 
component of VH/370 refer to VMZlZO Data Areas and Control Block Logic. 
For information on the printout format of supported error record types, 
refer to the QS/?S, DOS/YSE, 7H/370 Environmental Recording Edi^iia M5 
Printing (MM) ££22ES1* Support logic for this program is contained in 
the QS^iS, DQS^VSE, Vm2370 Environmental Recording Editing and Printing 
(ESEP) Program Logic. ~ - - - - 
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Figure 23. Header Record Table (Part 1 of 2) 
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Record ID Code 
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05 = 3705 

08 = 2715 

09 = 3340 

OA = 3330-11 

11 = 3350 

12 = 2305-1 1 

16 = 3370 

17 = 3310 

FF = Reserved for IBH Ose 






Record 


Count 


Always 01 


[CD 1 


Date & 


Time 


RECORDER 


ICE 1 


CPU ID 


and Serial 


Store CPO ID 


IDF 1 


Max MCEL Length 


Store CPO ID 


ICG 1 


Version Number 
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Figure 23. Header Record Table (Part 2 of 2) 

VM/370 Recovery Features -- Introduction 

The primary objectives of VM/370' s recovery management support are: 

• To reduce the number of system terminations that result from machine 
malfunctions. 

• To minimize the impact of such incidents. 

The programmed recovery, which accomplishes these objectives, allows 
system operations to continue whenever possible, and records the system 
status for all errors. The MCH (Machine Check Handler) and CCH (Channel 
Check Handler) provide the recovery management functions of VH/370. 



MACHINE CHECK HANDLER 



A machine malfunction can originate from a p 
control storage, or a channel group. When 
work properly, the hardware tries to corre 
machine recovers from the error through its 
machine check interruption notifies the 
handler routine. The machine check handler 
machine operated improperly. Concurrentl 
interruption, the processor logs out fields 
storage. This information describes the cau 
MCH analyzes this information and builds the 



rocessor, processor storage, 
any one of these fails to 

ct the malfunction. If the 

own recovery facilities, a 

appropriate machine check 

records the fact that the 

y with the machine check 
of information in processor 

se and nature of the error, 
machine check record. 



If the machine fails to recover from the error through its own 
recovery facilities, a machine check interruption occurs, and an 
interruption code indicates that the recovery attempt failed. The 
machine check handler then analyzes the data and tries to keep the 
system as fully operational as possible. The cause of the malfunction 
determines what action the machine check handler takes: 
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• Sesuae operations, leaving no adverse effects on the system. 

• Resuae system operations by terminating the virtual machine that was 
int erru'^t ed, * 

• isoiat«- tiie- failure to a page ^nd flag the page- as i nvali^ i>r 
unavailable for use by the paging supervisor. 

• Place the system in a disabled wait state. 

• If the 158 AP or 168 AP operating in attached processor mode had an 
unrecoverable malfunction occur on the attached processor in problem 
program state, resume operations in uniprocessor mode. 

Virtual machines for which VMSAVE (Directory option or SET command 
operand) is enabled normally have their register and storage contents 
saved in the event of certain abend situations. However, the following 
machine errors cause a disabled wait PSW to be loaded and may prevent 
saving the contents of a virtual machine. 

MCIC invalid 

PSH masks, key, program mask or CC invalid 

Floating-point, control, or general registers invalid when CP was in 
control 

System damage 

Timer damage 

CPU clock damage 

Instruction processing damage when CP was in control 

Machine check recursion 



CHANNEL CHECK HANDLER (CCH) 

The channel check handler is a resident program that receives control 
from the I/O supervisor when a real channel error occurs. CCH records 
the error. CCH reflects channel control checks, channel data checks, 
interface control checks, and channel interface inoperative (for a 
dedicated channel) to the virtual machine to allow the SCP in that 
virtual machine to attempt recovery, and/or initiate appropriate 
termination procedures. If CCH determines that system integrity has not 
been damaged, channel errors associated with an input/output operation 
initiated by CP (for example, paging or spooling) are retried by the 
appropriate device-dependent error recovery procedure. 
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If CCH determines that system integrity has been damaged (for 
example, if the channel has been reset, or if the device address stored 
is invalid) , CCH places the system in a disabled wait state and sends a 
message to the VH/370 primary system operator. For the 4331 and 4341 
-P.ro.c_essjDrsj,_ limited cha_nnel logout is still available, but no fixed or 
I/O extended logout area exists. " 

Virtual machines for which VMSAVE (Directory option or SET command 
operand) is enabled normally have their register and storage contents 
saved in the event of certain abend situations. However, catastrophic 
channel errors cause a disabled wait PSW to be loaded and may prevent 
saving the contents of a virtual machine. 



HANDLING OF HARD MACHINE CHECKS 

If a permanent error (hard machine check) occurs on the main processor 
(or attached processor) , the error is analyzed to determine whether or 
not it is correctable by programming, Time-of-day clock and timer errors 
that result in a machine check interruption that are not correctable and 
cannot be circumvented place the real computing system in a disabled 
wait state. 

Uncorrectable or unretryable processor errors, storage errors, and 
storage protect key failures are handled as discussed in the following 
paragraphs. 



Pl2£§§sor Errors 

When a machine check interruption indicates that a processor error 
associated with VM/370 cannot be corrected or retried the system 
operator is informed of the error and the system is put in a disabled 
wait state. All virtual machine users must log on again. If the error 
is associated with a virtual machine, the user is informed of the error 
and the virtual machine is reset, unless it is using the virtual=real 
option. In that case, the virtual machine is terminated, and the user 
must then log on and reinitialize (via IPL) his machine. 

If VM/370 is being run in attached processor mode and an 
uncorrectable error is encountered on the attached processor while 
executing in problem program state, system operation continues in 
uniprocessor mode on the main processor. 



Storage Errors in a Virtual Machine Page 

When the control program (CP) detects a permanent storage error (hard 
machine check) in a real storage page frame that is being used by a 
virtual machine, the frame is marked invalid if the error is 
intermittent, or unavailable if the error is solid. If the page frame 
has not been altered by the virtual machine, a new page frame is 
assigned to the virtual machine and a backup copy of the page is brought 
in the next time the page is referenced. All storage errors are 
transparent to the virtual machine user. 

If the page frame has been altered, VH/370 resets the virtual 
machine, clears its virtual storage to zeros, and sends an appropriate 
message to the user. If the virtual machine is using the virtual=real 
option, it is terminated. In either case, normal system operation 
continues for all other users. 
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§i.2£aae Errors in the CP Mii^iSiis 

Multiple-bit storage errors in the CP nucleus cannot be corrected; they 
cause VM/370 to terminate, (Single-bit storage errors are corrected by 
ECC, as noted above.) 



Storage Protect Kef Failures 

When interittittent storage protect key failures occur, whether associated 
with VM/370 or a virtual machine, the key is corrected and operation 
continues. 

If the storage protect key error is uncorrectable (solid) and is 
associated with a virtual machine, the user is notified and the virtual 
machine is terminated. The page frame is marked unavailable. 
Uncorrectable storage protect key failures associated with VM/370 cause 
the VM/370 system to be terminated. An automatic restart reinitializes 
VM/370. 



HARDLISG OF SOFT MACHINE CHECKS 

Although hard machine cheeks always cause a machine check interruption 
to occur and logouts to be written, soft machine checks are handled in 
one of two operating modes — recording mode or quiet mode, 

• In recording mode, soft machine checks cause machine check 
interruptions and write logouts. 

• In quiet mode, only hard machine checks cause machine check 
interruptions and write logouts. 

The normal operating state of VM/370 for CPU retry reporting is 

recording mode. For ECC (error checking and correction) reporting, the 

initialized (normal) state of VM/370 is model-dependent: quiet mode for 

all VM/370-supported processors except Models t55II and the 165II, The 
initial state for the 155II and 165II is record mode. 

A change from recording mode to quiet mode can occur in one of two 
ways: when 12 soft machine checks have occurred, or when the SET MODE 
BETRY/MAIN QUIET command is executed by maintenance personnel. 

To revert to record mode again, the command SET MODE RETRY/MAIN 
RECORD must be issued. 

In attached processor applications, soft error recording can be set 
or reset for the selected processor if so desired. 

If a soft machine check (a transient error) occurs while the system 
is in recording mode, a machine check record containing information 
about the error is written on the error recording cylinders. This 
record includes the data in the fixed logout area, the date, the time of 
day, and other pertinent data. The operator is not informed that a soft 
machine check has occurred. 

If a transient error occurs while the system is in quiet mode, no 
machine check interruption occurs, and no logouts are written. The 
hardware, which had gained control when the soft machine check occurred, 
returns control to either VM/370 or the problem program, depending on 
which had control at the time the machine check occurred. 
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3. This step applies to attached processor operations only: Next the 
PSA (prefix storage area) of the main processor is printed followed 
by the PSA values for the attached processor if the system was in 
attached processor mode when the abend occurred. 

k-^ F o ll nwing this is data extracted f rom CP*s symbol t ab le (DMKSYH) . 



which contains the storage location of selected entry points for 
the CP system- 

5. The tabulations that follow the symbol table printout are pages 
that are applicable to the real system hardware. These blocks 
represent every channel, every control unit, and every device that 
is represented as available to 7M/370 operations. These blocks are 
designated as RCHBLOK, RCUBLOK, and RDEVBLOK, respectively. Those 
devices that are actively involved with system operations at the 
occurrence of system abnormal ending are indicated by an adjacent 
display of an active lOBLOK- 

6. These blocks are followed by statistics applicable to the spool 
files that are applied to the spooling devices (system reader, 
printer, and punch) . These blocks are designated as spooled file 
blocks (SFBLOK) . If no spooling activity exists, then the VHFDOMP 
output indicates this (as indicated in the following VMFDDMP 
sample) . 

7. The spooled file data is followed by the CORTABLE- This table 
indicates the real address of the four doubleword entries that 
contain pointers to the SWPTABLE, the PAGTABLE, the previous entry 
in queue, and the next entry in queue. Also contained in this block 
are flags to indicate whether the page is on the flush list, the 
free list, or is shared or unavailable,. The CORTABLE printout also 
indicates the user identity and the page assignment at the time of 
the abnormal ending. 

8. After the CORTABLE, there is a progression of data blocks that are 
related to each logged on user. They are listed in the following 
order: the virtual machine blocks (VKBLOCK) , virtual channel blocks 

(YCHBLOK) , virtual control unit blocks (VCOBLOK) , virtual device 
blocks (VDEVBLOK) , and virtual console control blocks (VCONCTL) . 
This is followed by Segment tables. Page tables and Swap tables 

(SEGTABLE, PAGTABLE, SWPTABLE) , respectively that are applicable to 
the associated user*s virtual machine activity. 

Figure 36 illustrates the output of a formatted VMFDUBP operation 
I (uniprocessor mode). Note that if RDEVBLOK 200 is for an FB-512 device, 
I the RECBLOK would be in a different format. For the actual format of an 
I FB-512 RECBLOK, see VM^370 Data Areas and Control Block Logic, Order No. 
I SI20-0884. 
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VM/370 SYSTEM ABEND CODE PRG05: DATE 09/08/72 



TIME 15:13:31 



GREGS 0-7 
8-15 



8-15 
FPRGS 0-1* 
TOO Cl^OCK 
CPU TIMER 



0000003't OOCSClCt 000000<48 00078010 00000000 000237F8 
0O0237DE 0007B668 00000000 O0033't<t8 00023480 00073A08 



00000000 00000000 OOOOOOOO 00000000 00000000 00000000 



00000000 
00012D22 



UUUUUUUtJ 
EFCOOOOO 



00000008 
00072390 



uuuouuuu 

00073930 



OOOOOOOO OOOOOOOO OOOOOOOO OOOOOOOO OOOOOOOO OOOOOOOO 
82636C06 1*5506000 TOO CLOCK COMP 8263E1B3 57000000 
FFFFFFFF CA337000 

CSW OOOOOOOO OOOOOOOO CAW 000158D0 

EXT OLD PSW lOOi* 070D0OOO 00015A56 

SVC OLD PSW 0008 OOOCOOQO OOOOD23E 

PGM OLD PSW 0005 OOOCOOOO 00023!n2 

MCK OLD PSW OOOOOOOO OOOOOOOO 

I/O OLD PSW OQife 07OD0O00 0OO176'»A 



OOOOOOOO OOOOOOOO 



INT TIMER OOOOOEOO 

EXT NEW PSW OOOCOOOO 000009C8 

SVC NEW PSW OOOCOOOO 00000500 

PGM NEW PSW OOOCOOOO 00ailC58 

MCK NEW PSW 00080000 00011000 

I/O NEW PSW OOOCOOOO 00011*080 



OMKPSA 


000000 


DMKPSASV 


- 000500 


DMKPSANS 


- OOOBi*i* 


DMKPSADU 


- 000690 


DMKPSAEX ■ 


- 0009C8 


DMKFEIBM - 


000550 


DMKPSARX 


- oooaiA 


OMKPSAID 


- 000850 


DMKPSARS 


- 000860 


DMKPSARR - 


- 000866 


DMKMCH 


OllDOO 


DMKPRG 


- 011C53 


DMKPRGCT 


- 011D30 


DMKPRV 


- 012120 


DMKPRVCT - 


- 012178 


DMKPRVLG - 


012120 


DMKPRVKY 


- 0127A8 


DMKHVC 


- 012998 


OMKHVOAL 


- 012998 


DMKHVCYL ■ 


- 012EA0 


DMKHVCPC - 


012EA8 


DMKGEN 


- 012F08 


DMKDGD 


- 0130E0 


DMKVAT 


- 013508 


DMKTMR 


- 013E10 


DMKIOS 


011*020 


DMKIOSOR 


- 011*020 


DMKIOSQV 


- 011*020 


DMKIOS IN 


- 011*080 


DMKIOSRW - 


- 01i*7AA 


DMKIOSCT - 


011*210 


DMKRIO 


- 019558 


DMKRIODV 


- 019558 


DMKRIOCU 


- 01BFD8 


OMKRIOCH - 


- 010393 


DMKRIOCT - 


01C518 


DMKRIOCC 


- 01C538 


DMKRIOUC 


- 01C53A 


DMKRIODO 


- 01C53O 


DMKRIOCN ■ 


- 01051*0 


OMKRIOPR - 


01051*8 


DMKRIOPU 


- 010551* 


DMKRIORD 


- 01C55O 


DMKCNS 


- 014A88 


DMKCNS IN • 


- 014A8 8 


DMKCNSID - 


0mE2E 


DMKCNSOF 


- 01i*FE8 


OMKTBL 


- 015088 


DMKRSP 


- 016788 


DMKRSPEX - 


- 015788 


DMKRSPHQ - 


017920 


DMKRSPID 


- 017950 


DMKRSPDL 


- 01791*8 


OMKRSPRD 


- 01791*0 


DMKRSPPR ■ 


- 017930 


DMKRSPPU - 


017938 


DMKRSPAC 


- 017928 


DMKRSPER 


- 017951* 


OMKDAS 


- 017A00 


DMKIOE 


- 01SA88 


DMKCCH 


019098 


DMKSTK 


- 01C56B 


DHKSTKCP 


- 010568 


DMKSTKIO 


- 010586 


DMKDSP 


- 01C5B0 


DMKDSPCH - 


01C580 


OMKDSPQS 


- 01CF08 


DMKDSPRQ 


- OlOFOC 


DMKDSPA 


- 01C5D1* 


DMKDSPB • 


- 01C5F8 


OMKDSPNP - 


OICFIC 


DMKOSPCC 


- 010F20 


DM<DSPAO 


- 01CF21* 


DMKOSPBC 


- 01CF28 


OMKSCH 


- 01D008 


DMKSCHNl - 


01D7D0 


DMKSCHN2 


- 01D7DC 


DMKSOHCT 


- OlDOCO 


DMKSCHPU 


- 0ID7E0 


DMKVIO 


- 010838 


DMKVtOEX - 


01D838 


DMKVIOIN 


- 01DED2 


DMKVIOMK 


- 01E2A1* 


DMKVIOOT 


- 01E29O 


DMKVIOOW ■ 


- 01E2AO 


OMKCCW 


OlEitSS 


DMKCCWTR 


- 01E1*88 


DMKUNT 


- 01F5A0 


DMKUNTRN 


- 01F5A0 


DMKUNTFR ■ 


- 01F5F2 


DMKUNTRS - 


01F886 


OMKVSP 


- 01F9A3 


DMKVSPEX 


- 01F9A8 


OMKVSPCR 


- OIFDCA 


DMKVSPCO • 


- 020330 



Figure 36, Formatted VMFDUMP (Part 1 of 6) 
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(cent symbol) , line edit use 23 



! (attention symbol) 26 
sample usage 31 



# (pound symbol) , line edit use 23 



a (At sign) , line edit use 23 



" (double quotes) , line edit use 23 



A 

abnormal termination (abend) (see also 
VMFDUMP) 
dump 106 
access, virtual machine requirements 11 
address 

alteration 

console 24,28 
areas, error recording (see error 

recording cylinders) 
assistance, system operator 13 
At sign (3) , line edit use 23 
ATTACH, command, usage 24 
attached processor 

operation, results of uncorrectable 

errors 69 
summary of machine check handler action 

68 
system damage 65 
attention 

signaling, sample usage 26 
symbol (!) 

sample usage 26,31 



CCH (channel check handler) 
additional functions 72 
error messages 74 
function 66 
initialization 72 
overview 60 

overview (5748-XX8) 60.1 
overview (5748-XE1) 60.1 
reaction to errors 74 
summary 71,73 

privilege class 14 

/system operator, relationship 13 

use of IPCS 13 

virtual machine 

capabilities/limitations 3,5 

protective features 3 

typical configuration 14 
cent symbol (0) , line edit use 23 
channel check 

effect on virtual storage preservation 

( 574izXX8) 6 1 
effect on virtual storage preservation 

( 57 4 8-XE 1 ) 6 1 
error record layout 56 
handling by S?c 76 41 
reflection to virtual machine 60 
reflection to virtual machine (5748-XX8) 

60.1 
reflection to virtual machine (5748-XE1) 

60.1 
system action 72 
channel check handler (see CCH (channel 

check handler)) 
checks, terminal facility 22 
class, privilege 16 
CHD command, RSCS usage 103 
CMS (Conversational Monitor System) 
prerequisite for CPEH2P 16 
warning, file destruction 17 
codes 

line transmission 19 
wait state 74 
conditions, terminal communication line 22 
console functions, systems, CP command 

equivalency 3 
control block linkage 

environmental data recording 51 
fatal error 50 
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I/O operation 44 

I/O retry 46 

SDR recording 47 

structure for sense byte analysis 45 

2305 environmental data recording 50 

3330/3340/3350 environmental data 
recording 51 
control units, line 18 

correspondence (line transmission code) 19 
CP commands, equivalency to system console 

functions 3 
CP nucleus, storage errors 62 
CPEREP (see also EREP) 

ACCDEV 95 

ACCIN 95 

applications 96 

brief description of use 16,64 

CMS the environment for 16 

command entry 

file entry method 92 
mixed entry method 92 
prompting method 91 

command format 83-84 

consolidation of error recording from 
different systems 97 

DIRECTWK 95 

duplication of VS EREP»s IFCOFFLD 
(offload) function 96 

edit facilities 16,64 

ERSPPT 94 

FILEDEFs 94 

invoked, console entry methods 91 

operand entry, rules 91 

operands, brief descriptions 85-89 

OS/VS EREP overview 80 

OS/VS SREP relationship 79 

publication requirements for use 79 

screening of operands 80 

SERLOG 95 

shared I/O configuration changes 97 

SYSIN 94 

terminal session, anotated console 
listing 92 

TOURIST 95 

type of error records recorded 81 

use with MSS error records 16 

use with VS1/VS2 Subsystem Data Analyzer 
(SDA) Program 16 

user classes 82 

VS OS/EREP record formats 80 
CP-initiated I/O operation, error recovery 

63 
CP-owned volumes, linking to for test 

purposes 24 
cylinders (areas) , error recording 48 



damage to system, recovery attempts 64 
DASD (Direct Access Storage Device) 

environmental data recording, sense data 
50 

error recording conditions 53 
DASD device, testing 24 
data, security 12 
DEFINE, command (CP) , usage 24 
destruction, file 12 
devices, supported, line equipment 18 



diagnostic tests ( se e OLTSEP (Online Test 

Standalone Executive Program)) 
differences/exceptions, OS/VS EREP vs 

VM/370 EREP 82 
double error recording 64 
dump 

device specified by SET command 106 
system 106 



ECHO command 

sample printout 34 

used for terminal checkout 34 
editing 

error records, CPEREP 64 

input line 23 
EREP (see also CPEREP) 

CPEREP equivalency 16,64 

CPEREP relationship 79 

data set requirements 93 

reports, operand requirements 91 
error checking and correction (ECC) 70 
error handling overview 37 
error messages, to operator 44 
error recording 

conditions, specific devices 53 

CP modules used 43 

cylinders (areas) 

editing error records 64 
full condition 48 
virtual machine owned 38 

dedicated devices 39 

differences 39 

edit facilities 16,64 

functions 48 

outboard recordings 48 

record format 53 

relationship of I/O configuration to 
DHKRIO and MSC Table Create Program 39 

SVC 76 63 

types of errors 38 

virtual vs real addressing 39 

virtual vs real device type 39 
error recording cylinders (areas) 

(5248zXX8) 48 
error recording cylinders (areas) 

(5748-XEj) 48 
error recovery 

CP-initiated I/O operations 63 

features, introduction 60 

from soft machine checks 62 

functional 67 

I/O, detailed description 44 

levels 67 

machine check, hard 61 

modes 70 

operator-initiated restart 68 

procedures 63 

processor errors 61 

processor retry 37,67 

protection key errors 62 

storage error 61 

system 67 

system repair 68 

user termination 67 

virtual machine initiated I/O operations 
63 
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errors 

channel check 

effect on virtual storage 

preservation (574 8-XX8) 61 
effect on virtual storage 

preserv atio n (5 748-XE 1)61 

"handling by SVC 76 Tl~" 
reflection to virtual machine 60 
reflection to virtual machine 

(5748-XX8) 60.1 
reflection to virtual machine 

(5748-XE1) 60.1 
system action 72 
correction code (see error checking and 

correction (ECC) ) 
I/O, discussion 37 
machine check 

effect on virtual storage 

preservation (57U8-XX8) 60.1 
effect on virtual storage 

preservation (5748-XEl) 60.1 
system action 67 
messages, CCH, a referral 74 
record fields, source of data 57 
record formats 

channel checks 56 
header layout 58 
machine checks 56 
nonstandard 55 
unit check (short form) 55 
record layout, unit check 55 
record modifications, SVC 76 42 
reflection 37 
types recorded 38 
EXTERNAL, command, example of use 28 



F 

F privilege class 16 
failure (see also errors) 
line 22 
logon 23 
fatal error, control block linkage 50 
FB-512 devices (see fixed block devices) 

(5748- XX 8) 
FB-512 devices (see fixed block devices) 

( 5748-XE 1) 
FILEDEFs for CPEREP 94 
files 

destruction 17 
protection 12 
security 12 
fixed block devices 

accessing CE area (5748-XX8) 16 
accessing CE area (5748-XE1) 16 
environmental data recording 
unit check (5746zl2[8) 50 
unit check (5748-XE1) 50 
error recording cylinders (areas) 

(5748-XX8) 48 
error recording cylinders (areas) 

(5748zXEl) ^^ 
OBR records (5748::XX8) 53 

OBR records (5748-XE1) 53 

record ID code 



3310 (5 74 8- XX 8) 60 
3310 (5748-XE1) 60 
337 (574 8-XX8) 60 
3370 ( 5 74 8- X El) 60 
record modification for error recording 

(574 8-XX8) '^2 

record modification for error recording 

( 5748-XE 1) 42 
YHFDUHP 

sample output (5748- XX 8) 108 

sample output ( 5748-XE 1) 108 

formats, error record (see record, 

formats) 
frames, SRF (Service Record File) , 

description 48 
FRIEND 

OLTS 28 

sample printout 28 



G privilege class 14 



H 

hardware maintenance 

virtual machine, overview 1 
VM/370 essential requirements 9 
hardware problem analysis 

from a queued system task, 

advantages/disadvantages 2 
from the dedicated real system, 

advantages/disadvantages 1 
from the virtual machine, 
advantages/disadvantages 3 
header 

layout, error record 58 

record, error, source of data 59 



inboard error recording (see Recovery 

Management Support (RMS) ) 
input, line editing 23 
intensive recording mode 14,52,98 
I/O 

devices, specifying, error recording 98 
environmental data recording 50 
2305 control block linkage 51 
3330/3340/3350 control block linkage 
51 
error recording 

permanent error 50 

structure for sense byte analysis 45 
errors (see also hardware problem 
analysis) 

control blocks linkage I/O retry 46 
DASD error conditions 53 
discussion 37 
intensive recording 52 
maintenance from a virtual machine, 

statistical evaluation 4 
message to operator 44 
recovery, detailed description 44 
SDR recording 49 
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operations 

control block linkage 44 
CP 37 

virtual machine 37 
testing 23,28,30 
minidisk 24 
terminals 25 
IPCS (Interactive Problem Control System) , 
CE usage 13 



L 
line 

devices supported by \rM/370 18 
editing 23 

terminal facility check 22 
transmission 
codes 18 
tables 18 
line control units 18 
line delete, logical edit symbol 23 
LINK command, use for testing CP-owned 

volumes 24 
log records, SML 104 
logging errors, determination for error 

recording 38 
logical line delete 23 
logical line end, edit symbol 23 
logon 

a prerequisite for testing 21 
correspondence versus EBCD/PTTC codes 

20 
failure 22 

messages 23 
successful 22 
logout, storage assignment 75 



M 

machine check 

effect on virtual storage preservation 
(57 48 -XX 8) 60.1 

effect on virtual storage preservation 
(SlMzOl) 60.1 

error record layout 56 

hard, recovery 61 

soft, error recovery 62 
machine check handler (see MCH (machine 

check handler) ) 
malfunction (see errors) 
MCH (machine check handler) 

description 66 

function 66 

overview 60,66 

reaction to error 67 

summary 68 

with attached processor application 68 
MESSAGE command 

sample printout 33 

use as an aid to logon 22 

used for terminal checkout 32 
messages 

error, CCH, a referral 74 

to operator 44 
minidisk testing 24 
mode 

intensive recording 14 

system recovery (see recovery, mode) 



modifications, error record, SVC 76 42 

MONITOR command, trace data 106 

MSG command (CP) (see MESSAGE command) 



N 

NETWORK command, TRACE operand, brief 

description 103 
nonstandard record layout, error record 

notional conventions, brief description 82 





OLTS (Online 
example of 
invoking 
maintainin 
test runs 
testing th 
virtual ma 
environme 
OLTSEP (Onlin 
Program) 
initializa 
maintainin 
OLTS (see 
OLTSEP-RETAIN 
OLTS/FRIEND 
sample pri 
testing, o 
Online Test S 
Test Section 
Online Test S 
(see OLTSEP 
Executive Pr 
operands 
CPEREP 
' brief d 
require 
CPEREP for 
operating sys 
operator (se 
OS/VS EREP ( 



Test Sections) 25 

printout 25 
22,24 
g 9 

from the virtual machine 4 
e virtual console 27 
chine vs standalone system 
nt, test results analysis 4 
e Test Standalone Executive 

tion 11,22 
g 9 

OLTS (Online Test Sections) ) 
, invoking 30 
28 

ntout 28 

perator assistance 28 
ections (see OLTS (Online 
s) 

tandalone Executive Program 
(Online Test Standalone 
ogram)) 



escription 85-89 

d for reports 91 

mat 83-84 

tem, recognition by SVC 76 41 

e system operator) 

see EREP) 



parameters, passing 41 

permanent, I/O error, error recording 50 

pound symbol (#) , line edit use 23 

prerequisites, OLTS 21 

printout 

sample of ECHO 34 

sample of FRIEND 28 

sample of MESSAGE usage 33 

sample of OLTS 25 

sample of RETAIN 31 
privilege classes, CE's 16 
problem analysis (see hardware problem 
analysis) 

from the virtual machine 3 
processor errors, VM/370 recovery 61 
processor retry 37,67 
protection key errors, error recovery 62 
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QUERY, comaand (CP) , example of use 28 
quiet recording mode 71 



B 

real machine vs virtual machine, hardware 

maintenance 1 
recording, intensive mode 52 
recording modes (see also SET command (CP) 

RECORD operand) 
recording of error records, type recorded, 

YM/370 vs 0S/VS2 81 
records, format, error recording 53 
recovery (see error recovery) 
Recovery Management Support (RMS) 

damage assessment 68 

objectives 66 

summary of functions 66 

uncorrectable errors, machine check 69 

7M/370 support 66 
relationship, CE/system operator 13 
requirements, for testing virtual machine 

11 
reset, intensive recording 99 
restart 

after system damage 64 

operator-initiated 68 
RETAIN 

procedures, invoking 30 

sample printout 31 
retry 

processor 37,67 

via SET MODE command 99 
RMS (see Recovery Management Support) 
RSCS (Remote Spooling Communications 

Subsystem) , tracing the line 103 
rules, CPEREP operand entry 91 



SDR (Statistical Data Recorder) 
record layout 55 
VM/370 usage 47 
SDR recording, initiated by SHUTDOWN 49 
security, data (file) 12 
sense data, DASD environmental recording 

50 
Service Record File (see SRF) 
SET command (CP) 
MODE operand 

description 99 
threshhold count 100 
usage 62,70 
use 99 
RECORD operand 

description 98 
examples 99 
usage 52 
SML (Spool MULTI-LEAVING), log records 104 
soft errors 

count control 70 

explanation 68 

limiting 70 

recording at system initialization 70 



SRF (Service Record File) 
access to 48 
frames, description 48 
starter system 21 
storage, assignments, logout 
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CP nucleus 62 
system recovery 61 
STORE, command, example of use 
supported systems, SVC 76 40 
SVC 76 

description 40 
error record modifications 
type DDR 43 
type MDR 43 
type MIH 43 
type OBR 42 
type program abend 42 
error recording 63 
requirements 38 
functions 40 

handling of channel errors 
operating system recognition 
parameter passing 41 
systems support 40 
system 

configuration 
minimum 21 
21 



75 



28 



42 



41 



41 



starter 
dump 106 
repair 68 
termination 



67 



system console functions, CP command 
equivalency 3 

system damage 

attached processor recovery 65 
attached processors, affinity reset 
system restart facilities 64 

system operator 
aid from 13 
assistance 13 
/CE relationship 13 

SYS1.L0GREC (see error recording) 



65 



24,28 



18 
18 



terminals 

address alterations 

check 

via ECHO 34 
via MESSAGE 32 

supported by VH/370 

transmission codes 
termination, system operation (see fatal 

errors) 
test (s) 

diagnostics (see OLTSEP (Online Test 
Standalone Executive Program)) 

line transmission code 19 

MESSAGE command 23 

minidisk 24 

system check, basic 22 
testing, from a virtual machine 11 
threshhold count, SET MODE 100 
trace 

NETWORK command, brief description 103 

RSCS line 103 
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TRACE coimand (CP) (see also NETWORK 
command TRACE operand) 

described 101 

invoking, examples 101 

output 101 

printout segment 102 
trace table, MONITOR coimand 106 



uncorrectable errors 
machine check 

attached processor action 69 
uniprocessor action 69 
system action 69 
unit check 

{see also error handling) 
error record layout 55 
error record layout (short form) 55 
error recordings for 2305, 3330, 3340, 
3350 53 
user 

terminals 18 

termination, error recovery 67 
user class required, CPEREP 82 



virtual console, testing 26 
virtual machine 

a tool for I/O problem analysis, 
statistical evaluation U 

as troubleshooting aid 1 



CE»s 



3,5 



38 



capabilities/limitations 

protective features 3 
error recording 39 
error recording cylinders (areas) 
I/O error recovery 63 
results of uncorrectable errors 69 
the CE's lU 
Virtual Machine Facility/370 (VM/370) 
error recording, differences 39 
online message, 3704/3705 22 
recovery features, channel check handler 
60 



virtual machine facility/370 (VM/370) 
recovery features 

channel check handler (5748-XX8) 

60.1 
channel check handler (5748^111) 
60.1 
Virtual Machine Facility/370 (VM/370) 

recovery features, machine check handler 

60 
support, RMS 66 
system, recovery 67 
virtual storage preservation 

effect of channel check on (5748^ XX8) 

61 
effect of channel check on (5748-XE1) 

61 
effect of machine check on (5748-XX8) 

60.1 
effect of machine check on (5748-XE1) 
60.1 
VMFDDMP command 

description 106 
sample of output 108 
sample output (5748-XX8) 108 
sample output (5748-XEl) 108 
VMSAVE 

effect on virtual storage preservation 
channel check (5748-XX8) 61 
channel check ( 5748 -XEI) 61 
machine check (5748-XX8) 60.1 
machine check (5748-XEI) 60.1 
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wait state codes 74 



3277, OLTS testing 26 

3310 (see also fixed block devices) 
record ID code (5748- XX8) 60 
record ID code (5748-XEI) 60 

3370 ( se e also fixed block devices) 
record ID code (574 8- XX 8) 60 
record ID code (5748-XEI) 60 

3704/3705, VM/370 online message 22 
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